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SOME  METHODS  OF  COMPARING  SOCIOKETRIC  MATRICES^ 


Franz  E.  Kohn 

Department  of  Mathematics,  University  of  Illinois 

Overview 

An  approach  often  used  in  sociometric  investigations  is  to  have  each 
of  the  members  of  a group  of  n individuals  rank  all  the  others  according  to 
some  characteristic  such  as  wisdom,  leadership  ability,  like ability,  etc., 
with  the  object  of  studying,  and  perhaps  improving,  the  structure  of  the 
group  with  respect  to  this  relationship.  One  of  the  great  desiderata  in 
this  connection  is  a fruitful  method  of  comparing  results  obtained  from 
the  same  group  (a)  at  different  times  but  with  respect  to  the  same  character- 
istic or  (b)  at  the  same  time  but  with  respect  to  different  characteristics. 
Again,  vhen  the  members  of  two  groups  of  the  sane  size  are  in  one-to-one 
correspondence,  it  may  be  desirable  to  compare  results  obtained  from  the 
two  groups  with  respect  to  the  sane  characteristic*  This  paper  presents 
two  measures  which  may  be  useful  in  making  comparisons  such  as  these. 

For  the  investigation  of  what  is  called  here  the  hierarchical  structure 
of  a group,  this  paper  introduces  first  a function  "h”  of  the  ranks.  This 
h,  called  the  hierarchy  index,  ranges  from  the  value  0 when  the  members  of 


— ^This  is  Technical  Report  No.  5,  prepared  under  contract  No.  N6ori- 
07135  between  the  Office  of  Naval  Research  and  the  College  of  Education  of 
the  University  of  Illinois. 
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the  group  are  indicated  by  the  data  to  be  "eqral"  with  respect  to  the 
characteristic  in  question,  to  the  value  1 when  the  most  extreme  type  of 
hierarchical  relationship  appears.  Differences  between  the  h-values  obtain- 
ed from  two  sets  of  observations,  whether  from  the  same  group  or  from  dif- 
ferent groups,  may  be  tested  for  significance. 

The  paper  next  introduces  a coefficient  of  agreement  11  Gn  as  another 
method  of  comparing  the  group  structures  associated  with  two  sets  of  data 
from  groups  of  the  same  size.  The  coefficient  9 ranges  from  -1  when  the 
data  display  opposite  hierarchical  characteristics,  to  +1  when  they  display 
the  same  hierarchical  characteristics. 

The  measures  0 and  h appear  to  be  of  considerable  sensitivity.  How- 
ever, their  usefulness  in  sociometric  research  will  depend  ultimately  on 
the  significance  of  the  concepts  on  which  they  are  based  and  on  the  appro- 
priateness of  the  manner  in  which  the  measures  arithmetize  those  concepts. 
These  issues  can  be  decided  only  by  the  test  of  actual  use  of  the  measures, 

1 , Assumptions  and  Definitions,  Let  us  assume  that  in  a group  of  n in- 
dividuals, each  member  ranks  the  other  n-1  members,  from  highest  to  lowest, 
according  to  some  agreed-on  characteristic,  iiore  specific! ally,  we  assume 
that  each  member  assigns  the  other  members  the  r anks 
n— 1,  n— 2 , ,,,,  3)  2,  1 

in  some  order,  so  that  in  fact  no  two  individuals  are  assigned  the  same  rank 
by  any  one  member.  It  is  convenient  to  assign  the  members  of  the  group  the 
names  Ml",  "2",  "n".  Then  the  ranking  data  may  be  recorded  systemati- 

cally in  the  form  of  a tables 
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Here  the  entries  1,  2,  nin  the  outer  column  are  the  names  of  the 

members  assigning  the  ranks  and  the  entries  in  the  top  row  are  the  names 


of  the  members  being  ranked. 

It  is  desirable  for  the  purposes  of  what  follows  to  enter  a zero 
in  the  first  row  and  first  column,  the  second  row  and  second  column,  and 
so  on  throughout  the  table.  This  is  as  though  each  individual  were  requir- 
ed to  assign  himself  the  rank  zero.  If  we  denote  the  entry  in  the  row 
and  k£2-  column  of  the  body  of  the  table  by  s^,  then 

(p  if  11  j"  assigns  "k"  the  rank  p 


(1.1) 


jk 


k. 


We  then  have  the  square  array  of  ranks: 


!e 

S12 

S13  ’ 

* * Slni 

;s2i 

0 

S23  * 

* * S2ni 
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4 

• 4 
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4 4 
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s . 

4 4 0 

: nl 

n2 

n3 

We  shall  call  this  the  data  matrix  S of  the  group  with  respect  to  the 
characteristic  in  question.  In  experimental  work,  it  is  important  to 
remember  that  this  data  matrix  is  not  necessarily  a stable  property  of  the 


-ii- 

group,  but  may  change  drastically  with  time.  Indeed,  the  proper  control 
of  such  change  might  ’.jell  be  a primary  objective  in  some  circumstances. 

In  S we  now  form  the  column  totals  to  get  the  score  structure  "s"  of  S: 

s = (s  , s , «.*,  s ) 

l d n 

where 

k 

(1.2)  s«s  + s +...+S  =2  s. 

k lk  2k  nk  3=1  jk 

The  total  score,  s^,  received  by  the  k— : individual  will  be  called  simply 
his  score.  Two  score  structures  differing  at  most  in  the  order  of  the 
integers  appearing  in  them  will  be  called  similar. 

The  data  matrix  S associated  with  a given  group  is  not  uniquely  de- 
fined since  a different  assignment  of  "names"  to  the  members  of  the  group 
results  in  a symmetrical  permutation  of  the  rows  and  columns  of  S.  It 
is  a matter  of  definition  that  this  does  not  change  the  structure  of  the 
group.  For  example,  if  n = 3>  we  might  obtain  for  one  method  of  naming  the 
members  of  the  group,  the  data  matrix 

ib  21 
s — : 2 0 1 

2 1 0 

If  now  ws  interchange  the  names  of  members  "l"  and  "3"f  we  in  effect  inter- 
change the  first  and  third  rows  as  well  as  the  first  and  third  columns  of  S, 
Je  thus  obtain  the  data  matrix 

TO  1 2] 

1* 

10  2 
X I 

!1  2 0 


It-  will  be  noted  that  S and  S^,  like  any  two  data  matrices  differing  only 

because  of  different  arrangement  of  the  members  of  the  group,  have  score 

structures  which  are  similar  according  to  our  previous  definition.  The 

converse  is  not  true,  however.  That  is,  the  data  matrices  of  groups  having 

similar  score  structures  cannot  always  be  transformed  one  into  the  others 

simply  by  rearranging  the  individuals  of  the  group. 

From  the  definition  of  the  ranking  operation,  it  is  clear  that  one, 

2 

but  not  more  than  one,  of  the  s^'s,  can  assume  the  value  (n-1)  = (n-l)«(n-l) 

and  that  no  higher  score  is  attainable.  Similarly,  one  of  the  s,  *s, 

K 

but  not  more  than  one,  can  assume  the  value  (n-l)*l  whi  ,h  is  the  lowest  score 
attainable,  ihe  scores  s^  have  the  total 

(1,3)  Z s - n f(n-l)  + (n-2)  + „.  + 2 + ll  ° .rl2(n-l) 
fc-1  k «■  J 2 

since  the  expression  in  brackets  is  the  sum  of  the  entries  in  any  one  of 

the  n rows.  It  follows  that  each  score  structure  is  a partitioning  of 

n2(n-l)/2  into  n parts  s^_  such  that 

n-1  <(  sk  ^ (n-1)2,  k = 1,  2,  n 

2 

and  such  that  not  more  than  one  s is  (n-1)  or  (n-1). 

The  number  of  distinct  data  matrices  S which  are  possible  for  a given 
value  of  n is  rather  large  even  when  n is  small.  In  fact,  there  are  (n-1) l 
ways  of  filling  the  off-diagonal  positions  in  each  row,  so  that  there  are 
[(n-l)j]n  such  matrices  possible.  For  n-3,  this  is  (2i)^  a 8,  but  for 
n=U  it  is  (31  )k  *=  1296. 

By  way  of  illustrating  some  of  the  ideas  so  far  presented,  we  list 
all  possible  data  matrices  for  the  care  n«3»  The  matrices 
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[0  2 1;  fo  1 2\ 

I 

1 i i 

j 1 0 2 ; and  2 0 l ; 

! 2 1 0 1 1 2 0 ! 

_ .J 

each  has  the  score  structure  (3 *3,3) * Alsoj  interchanging  the  names  of 
individuals  2 and  3 will  reduce  one  matrix  to  the  other.  The  matrices 


1 

0 

1 


i 


f*-  ^ 

;o  i 2 1 

* i 

!l  o 2 1 

i I 

il  2 oj 


all  have  a score  structure  similar  to  (i*,3,2)  end  again,  ary  one  may  be  ob- 
tained from  any  other  by  symmetric  permutation  of  tne  rows  and  columns. 

This  last  mentioned  property  does  not  persist  beyond  n * 3.  Indeed, 
the  matrices 


0231' 

"032  il 

1 

3021 

3021 

and 

H 

O 

CM 

cm 

2301 

^2310 

3120 

L.  •* 

both  have  score  structure  (3,7j6,3)  but  one  cannot  be  reduced  to  the  other 
by  symmetrical  permutation  of  the  rows  and  columns.  In  the  first  the  four  3's 
are  distncuted  among  three  individuals,  vh  ere  as  in  the  second  they  are 


distributed  between  two  individuals. 
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The  large  number  of  data  matrices  possible  for  given  n emphasizes  the 

importance  of  the  score  structure  as  a method  of  condensing  the  information 

contained  in  the  matrices.  When  n * we  have  12 $>6  matrices,  as  noted  above, 

but  there  are  only  16  score  structures,  as  will  appear  later. 

2.  The  Hierarchy  Index.  Among  the  score  structures  two  extreme  cases  are 

of  particular  interest.  The  equality  is  the  structure  with 

(2.1)  s_  “ srt  = ...  “ s “ A- C.r-riU, 

12  n c 

that  is,  the  case  when  every  member  of  the  group  has  the  same  score. 

An  example  of  this  is  given  by  the  matrix: 


0 

n-1 

• •• 

1 

1 

0 

• • • 

2 

2 

t 

• mm 

3 

3 

2 

• # • 

h 

• • % 

n-2 

n-3 

• • • 

• • • 

n-1 

n-1 

n-2 

• • • 

0 

At  the  other  extreme  is  the  score  structure  which  we  call  the  extreme 
hierarchy.  In  the  case  of  this  score-structure,  we  have  one  individual 
with  the  highest  possible  score,  thereafter  one  with  the  resulting  next- 
highest  possible  score,  and  so  on. 


For  example  we  may  have 
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with 


0 

n-1 

n-2 

• •• 

2 

1 ! 

n-1 

0 

n-2 

• • • 

2 

1 

J 

n-1 

n-2 

0 

• • • 

2 

1 

1 j 

n-1 

n-2 

n-3 

2 

i 

1 1 

} 

\ 

• • • 

• ■ a 

» • • 

• 

1 

• 

i 

n-1 

n-2 

n-3 

• • » 

2 

i ! 

n-1 

n-2 

-TV  3 

• • • 

0 

1 1 

n-1 

n-2 

n-3 

• • • 

1 

° ! 

{ “ (n-1)  (n-1) 

S2  * (n“2)  ^n"2^  + 1*  ^n“1^ 


(2.2)  < \ * (»•*)  (n-k)  + (k-1)  (n  - k + 1) 


sn-1  - 1*1  + (n-2)»  2 
sn  - (n-l>  1 


Any  score  structure  similar  to  (2,2)  is  also  an  example  of  the  extreme 

hierarchy.  A little  thought  will  make  it  clear  that  any  data  matrix  whose 

score  structure  is  an  extreme  hierarchy  can  bo  reduced  to  the  form  given 

above  by  suitable  numbering  of  tho  members  -j±  Jfie  y ouua 

"!a2) 

we  have  s,  - s,  = n-2  so  that  the  scores 
k k-»i 

2 

of  s are  equally  spaced  n-2  units  apart,  beginning  with  (n-1)  and  ending 
with  n-1.  It  is  thus  a simple  matter  to  write  down  the  score  structure 
for  sn  extreme  hierarchy  when  n is  given. 


In  the  extreme  hierarchy/ 
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In  order  to  obtain  a measure  of  -.There  a given  score  structure  falls 
between  the  equality  and  the  extreme  hierarchy,  we  note  first  that  the  mean 
score  of  a score  structure  is 


(2*3>  1 • 


and  that  the  variance  is  therefore 


(2.U)  Var.  - 1 2 (sk 


The  maximum  variance  of  the  s*s  will  evidently  be  obtained  when  they 
have  the  values  listed  in  (2.2).  The  variance  in  this  case  is  given  by 

n r I2 

(2.5)  Varraac  » i 2 j (n-k)2  + (k-l)(n-k+l)  - 

k=1t  J 

<=  (n2  - l)(n-2)2/l2. 

We  msy  therefore  define  a hierarchy  index,  h,  as  the  ratio  of  the  actually 
observed  variance  to  the  maximum  possible  variance.  From  (2.h)  and  (2,5), 
we  see  then  that 
(2.6  ) 

This  reduces  without  difficulty  to  the  form: 


From  the  definition  it  follows  at  once  that  the  minimum  value  of 
h is  zero.  This  value  is  obtained  when  the  score  structure  is  the  equality. 
Similarly,  the  maximum  value  of  h is  1,  which  is  obtained  when  the  score 
structure  is  an  extreme  hierarchy. 
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It  should  be  noted  that  we  can  have  a hierarchy  in  the  sense  that 

Sl'  S2>***>V 

for  example,  without  having  the  extreme  case  defined  by  (l.ii).  For  such  a 
hierarchy,  h < 1*  An  example  is  given  by  the  data  matrix 

0 li  3 1 2 : 

h 0 3 2 lj 

r 

U 3 0 2 1 j 

U 3 2 o lj 

; 3 k 2 1 0 • 

with  s = 03,  11*,  10,  6,  5),  h * 1*1/15  *=  «91+*  However,  if  we  have  an  extreme 
hierarchy  where  n * 5>,  the  matrix  has  a score  structure  similar  to 
s = (16,  13,  10,  7,  1*),  with  h*=l,  of  course. 

Another  interesting  special  structure  occurs  only  when  n is  even,  be- 
cause it  takes  an  odd  number  of  persons  to  form  an  equality.  With  an  even 
number  of  persons,  vre  can  have 


so  that  if  n > 2,  we  have  one  leader,  the  other  members  of  the  group  con- 
stituting among  themselves  an  equality.  Such  a score-structure  may  be  called 
an  ex^reme  leadership.  Denoting  h by  in  this  case,  we  find 

h **  3 
1 n+1 

which  approaches  0 for  large  values  of  n.  Even  though  the  value  of  h^  is 
small,  an  extreme  leadership  might  have  great  social  importance. 

This  demonstrates  that  h is  not  to  be  regarded  as  a measure  of  the 
possible  social  importance  of  a given  score  structure.  It  is  only  a 
measure  of  resemblance  to  the  extreme  hierarchy. 
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To  give  a final  illustration  of  the  behavior  of  h,  we  list  in  Table  1 the 
16  score  structures  for  n*»lj  and  the  corresponding  values  of  h« 

TABLE  1 

SCORE  STRUCTURES  A HD  VALUES  OF  h FOR  n*l; 


SCORE  STRUCTURE* 

h J 

SCORE  STRUCTURE 

h 

9753 

1 

1 j 

8751^ 

5/io 

9711^ 

1 

866U 

k/10  ; 

9663  } 

9/10 

86551 

i 

3/10 

8853  j 

776L  j 

t 

i 

l 

88hk 

8/10 ; 

7755 

2/10 

9651;  i 

, 

7/10 

7665 

1/10  j 
( 

8763  / 
9555  \ 

6666 

i 

o l 
1 

j 

6/l0 

j 

f 

777 3 / 

1 

It  is  easy  to  verify  that  there  actually  are  data  matrices  giving  rise  to 
each  of  the  listed  score  structures, 

3.  Comparison  of  Score  Structures,  It  is  desirable  for  certain  purposes 
to  compare  matrices  of  rankings  such  as  described  above,  which  are  obtained 
at  different  times,  or  with  respect  to  different  characteristics,  from  the 
same  group.  Suppose,  for  example,  that  the  experience  and  education  of  a 
group  were  designed  so  as  to  produce  eventually  a group  with  the  extreme 
hierarchy  as  its  structure.  Progress  toward  this  end  could  be  measured 
by  observing  the  progress  of  h toward  the  value  + 1, 

Suppose,  on  the  other  hand,  that  we  'wish  to  compare  two  groups  of 
individuals  in  which  the  members  of  the  groups  can  be  placed  in  a logical 


*Tn  this  and  following  tables,  commas  are  omitted  from  the  score  structures. 
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one-to-one  correspondence.  Here  corresponding  members  of  the  two  groups 
would  be  assigned  the  same  number  '‘name*',  of  course.  f,s  a pair  of  such 
groups  we  could  use  two  baseball  teams,  for  example.  Then  the  score  structures 
of  the  two  groups  may  be  compared  simply  by  computing  the  product-moment 
correlations  of  these  score  structures.  For  fixed  n,  the  correlation 
coefficient  in  this  case  takes  on  one  of  a finite  set  of  values. 

Let  s»  (s  , s0S  ...  s ) and  s»®  (s’  , s^’,  ...  , 8 ')  be  two  score 
structures  obtained  from  the  same  group  or  from  two  groups,  under  the 
hypotheses  stated  above.  We  then  define  the  coefficient  of  agreement,  ©, 
of  the  two  score  structures  to  be 


Since 


e 


Cov  (s,  s*) 


Var(s)«Var{s* ) 


n 

Z 

k®l 


n 

Z 

k=l 


n(n-l) 




this  may  be  written  as 
(3.1) 


which  reduces  readily  to 


(3.2) 


Z sks»k  - 


n^(n-l)2 


J (^ . jhpL}  fa a? 

Since  either  n or  n-1  is  even,  (n-l)  will  always  be  an  integer. 

h 
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If  we  put 


we  have 


**  ■ \ - 3'k 


Jd2k  » Za2k  * Is ' k - 2Zsks'k 


so  that  the  formula  (3.2),  on  elimination  of  Zs^s*^,  may  be  written  in  the 
form: 

.2  ~ ,2 


(3.3) 


@ 


Zs2k  + 2s »k2  - | n3(n-l)  - Zd 


,/[2Zs2 k " | n3(n-1)2j  [2Ss’k2  - \ a3  (^1)1 

Since  the  s*s  are  ordinarily  not  very  large,  formula  (3*2)  is  not  too  in- 
convenient, especially  if  a table  of  products  is  available.  However, 
(3.3)  enables  us  to  work  with  a table  of  squares,  and  even  for  small  n 
it  is  apt  to  be  more  convenient  than  (3.2)  since  s^  may  be  as  large  as 
(n-1)2. 

In  the  event  that  both  s and  s'  are  similar  to  the  score  structure 
of  the  extreme  hierarchy,  we  have 

— 2 

[tn-icr  + tK-lKn-k+lJ| 

k=l 


2 2 n r 2 - 

Zs\_  = 2s*,  * Z (n-kj  + (k-l)(n-k+l)| 

K L J 


Then 


= (2n^  - 5n3  + 3n2  + 2A  - 2)  # 


2Zs2k  - | n3  (n-l)2  - 1 n (n2-l)(n-2)2 


and  6 assumes  the  much  simpler,  special  form 

0*U)  , . x . 6Zd2 

1 n(n2-l)(n-2)ii  . 


It  is  interesting  to  note  that  when  s is  the  extreme  hierarchy: 

s =«^(n-l)2,  (n-2)2  + (n-1),  ...  , (n-k)2  + (k-l)(n-k+l) 
and  s'  is  the  reverse  extreme  hierarchy,  namely 

s1  «*  (o2  + (n-1) "l,  l2  + (n-2)*2,  ...  , (k-1)2  + (n-k)*k, 

2 

then  2d  takes  on  its  maximum  value  which  is  in  fact 

(2d2)  = n/.n.2-.1I^:£-L 

v ■'max 


so  that 


0!  - 1-2 • 


2d2 


(2d2) 


max 


We  complete  the  definition  of  0 as  follows.  In  the  event  that  one 

or  both  of  s,  s>  should  be  the  equality,  the  expx-ession  (3*2)  for  0 is 

undefined.  In  order  to  preserve  the  symmetry  of  the  distribution  of 

values  of  0,  we  define  0 to  be  zero  in  such  a case. 

In  order  now  to  illustrate  the  computation  of  0,  let  us  consider 

a society  of  five  individuals  for  whom  are  obtained  the  three  following 

score-structures,  say  with  respect  to  three  different  characteristics: 

s - (16,  13,  10,  7,  U),  s’  * (15,  1U,  10,  6,  5)  and  s"  **  (9,  9,  9,  9,  Hi). 

For  s ana*1  we  have  2d2  - U,  2s~,  ~ 590  and  2s’2  = 582, 

k k 

Then 


0(s,s 1 ) 


590  + 582  - 1000  - It 
v/"(2*590  - 1000 ) ( 2 • 5 82  - 1000 J 


„ 168 

V lbo.l6u 


• 0.9891. 
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On  the  other  hand,  for  s and  s’*  we  have  Zd^  ■ 170,  Zs^2  • 590  and 
Zs**^  ■ 520*  Hence,  in  this  case. 


590  + 520  - 1000  - 170  = ,60 

'/ (4*590  - ioooT( 2*520  - loou;  \j  18o»IiO 


-0.35145 


These  results  seer,  intuitively  to  be  quite  satisfactory* 

The  measures  9 and  h are  of  course  related,  although  the  formula  is 
not  simple*  From  (2.7)  we  hav®,  after  * ’jie!le  manipulation, 

n(n2-l)(n-2)2h  0.?c2  _ n^(n-l)2 

b * k 2 

2 

so  that,  writing  a similar  formula  for  1.  • 1-;  terms  of  2s*  , we  obtain  after 

K 

substitution  into  (3.3): 

O-lt)  „ . if  li  " h'  ^4  \ 

8 ' J Ku'V  ) ' 

in  the  event  that  hh*  / 0. 

iu  Statistical  Considerations.  In  order  to  evaluate  the  significance  of 
changes  in  the  score  structure  of  a group,  or  the  significance  of  a difference 
in  the  score  structures  of  two  Oomp arable  groups,  we  need  certain  statistical 
tables*  For  instance,  when  n » 3»  we  consider  Tables  2 and  3.  In  these, 
the  probabilities  for  h are  computed,  on  the  assumption  that  the  rankings 
are  random  phenomena* 

Table  2 

PROBABILITY  DISTRIBUTION  OF  h FOR  n « 3 
Score  Number  of  matrices 

structure  with  a similar  score 


structure h Pr(h) Pr(h-^-  listed  value) 


U32 

6 

1 

0*75 

0.75 

333 

2 

0 

0.25 

1 
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TABLE  3 

PROBABILITY  TABLE  FOR  hg  - h L,  n - 3 


; Values  of 
h2  ' \ 

1 

i 

! 

! 

1 

No.  of  pairs 
of  matrices 
giving  listed 
value  of 

h2  - \ 

Pr(h2  - 

Pr  (h2  - 

listed 

value) 

-1 

12 

3 

IS 

1 

0 

hO 

10 

13 

16 

16 

+1 

12 

3 

3 

j 

IS 

16 

Suppose,  for  example,  that  as  a result  of  training,  h for  a group  of  3 
men  rises  from  0 to  +1.  The  probability  that  this  is  the  result  of  pure 
chance  rankings  is  ^ „ Suppose  next  the  method  of  training  applied  to 
five  such  groups  results  in  the  same  change.  The  probability  that  this  happens 
by  pure  chance  is^^j^  ■ 0.00023  so  that  we  may  conclude  that  the  method 
of  training  is  effective  in  bringing  about  this  change  if  we  set  the  level 
of  significance  at  0.001,  for  example. 

For  n ” 3j  we  may  also  construct  Table  L for  testing  in  a similar 
fashion  the  significance  of  observed  values  of  0. 


TABLE  h 

PROBABILITY  DISTRIBUTION  OF  0 FOR  n - 3 


Possible  values 
of  0 

No.  of  pairs 
of  matrices 
giving  listed 
value  of  0 

Pr(e) 

Pr^  © ^listed! 
• value  / 

-1 

6 

0.09375 

1.0000 

1 

2 

12 

0.18750 

0.90625 

0 

28 

0*U3750 

0.71875 

1 

? 

12 

0.18750 

0.28125 

1 

6 

0.09375 

0.09375 

-17- 


For  n » li,  we  have  Tables  5 and  6* 

TABLE  5 

PROBABILITY  DISTRIBUTION  OF  h FOR  n - 4 


Score 

structure 

Number  of 

matrices 

with  a 

•similar 

score 

structure 

h 

Number 
of  score 
structures 
with  given 
h 

Pr(h) 

Pr(h£ 

listed 

value) 

9753 

24 

1 

24 

0.0185 

0.0185 

9714* 

9663 

8853 

2h\ 

21*  f 

244 

9/10 

72 

0.0556 

0.0741 

8844 

24 

8/10 

24 

0.0185 

0.0926 

965U 

8763 

96  ) 
96; 

7/10 

192 

0.1482 

0.2407 

9355 

7773 

2U] 

2Uj 

6/10 

48 

0.0370 

0.2778 

8751* 

11*4 

5/10 

144 

0.1111 

0,3889 

8664 

96 

4Ao 

96 

0.0741 

0.4630 

8655 

7764 

144 

141* ; 

3/io 

288 

0.2222 

0.6852 

7755 

120 

2/10 

120 

0.0926 

0.7779 

7665 

264 

l/io 

264 

0.2037 

0.9815 

6666 

24 

0 

24 

0.0185 

1.0000 

1256 

1296 

1.0000 

-18- 

TABLE  6 


PROBABILITY  TABLE  FOR  hg  - n = h 
(For  negative  values  of  h2  - h^,  the  probabilities  may  be 
found  by  symmetry  considerations,) 


h2  -*i 

Pr(h2  - i^) 

PrChg  - value 

listed  3 

1 

0.0003U 

0.0003U 

9 

10 

0.00U8 

0.0051 

8 

15 

0.013 4 

0.0185 

7 

10 

0.0158 

0.0343 

6 

15 

0.0U63 

0.0806 

5 

15 

0.0336 

0.1142 

U 

10 

0.0686 

0.1823 

3 

10 

0.0556 

0.2383 

2 

To 

0.1070 

0.3453 

l 

To 

0,0823 

O.I276 

0 

0.1UU7 

0.572U 

These  last  two  tables  are  introduced  primarily  to  give  the  reader 
a little  more  feeling  for  the  behavior  of  h and  of  differences  in  h's,  A 
similar  probability  table  for  the  values  of  0 when  n * ii  would  not  take  too 
long  to  construct. 

Since  experimental  n's  are  apt  tc  be  a good  deal  larger  than  h,  the 

application  of  statistical  significance  theory  to  h and  © must  await  the 
determination  of  suitable  approximations  to  their  distributions.  Our  9 
seems  closely  related  to  Kendall's  "coefficient  of  concordance"  W,  so  that 
one  might  reasonably  expect  adaptations  of  his  methods  to  yield  useful  re- 


sults here 


Discussion  by 


Lee  J.  Cr^nbach 

Sociometric  methods  have  been  given  relatively  little  study 
as  a formal  problem  in  psychometrics,  although  a few  mathematical 
treatments  of  the  problem  are  appearing.  Since  it  appeared  probable 
that  a fresh  mind,  acquainted  with  matrix  algebra,  cculcl  suggest 
new  analyses  of  socior.etric  data,  we  asked  Dr.  Hohn  to  study  reports 
of  sociometric  studies  and  to  explore  whatever  leads  suggested 
themselves. 

His  paper  > ives  a detailed  analysis  of  a particular  approach 
which  he  calls  one  hierarchy  index.  In  this  comment,  I desire  to 
relate  his  development  to  comparable  procedures  used  in  test 
psychometrics,  and  to  indicate  some  oossible  interpretations , 

The  first  point  to  be  noted  is  that  h,  the  hierarchy  index, 

is  a ratio  of  variance  to  maximum  variance  (v/'-’  max),  buch  a ratio 

was  once  proposed  by  Ferguson  (3)  as  an  index  of  homogeneity  among 

test  items,  and  is  linearly  related  to  the  somewhat  more  familiar 

index  C/C  max  (C  being  total  interitem  covariance)  proposed  by 

Loevi; ger  (6).  In  most  studies,  these  are  not  superior  to  coefficient 
n C 

aloha  Tn-Tjv  , wuich  is  a general  form  of  the  Kudur-Richardson 
coefficient.  Alpha  is  an  excellent  measure  of  internal  consistency(2) . 

The  special  formula  for  h is  appropriate  to  sociometric  data 
■where  each  person  ranxs  every  other.  Unlike  the  item-person  matrix 
of  test  research,  this  matrix  is  square  and  has  fixed  ro.;  means. 
Moreover,  the  diagonal  entry  is  ordinarily  missing.  This  means 
thau  rows  cannot  be  perfectly  correlated.  These  properties  are 
considered  in  Dr.  Hchn’s  development.  Mo  study  has  been  made  of 
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the  degree  to  'which,  for  computational  purposes,  h is  superior  or 
inferior  tct^  or  Kendall's  J. 

The  suggestion  that  sociometric  matrices  be  evaluated  in  terms 
of  hierarchy  is  of  general  usefulness,  while  the  h formula  no 
longer  applies,  the  sane  general  technique  may  be  used  fcr  matrices 
where  the  person  reports  (say)  his  three  highest  and  three  lowest 
choices.  This  type  of  index  has  several  related  interpretations. 

(1)  h is  a measure  of  hierarchy.  As  h becomes  closer  to  1, 
the  choice  relations  among  a group  aoproach  a status  system  where 
each  person  orefers  persons  of  high  status  and  tends  not  to  prefer 
persons  below  him.  Groups  which  are  divided  into  cliques  will 
have  a lower  degree  of  hierarchy  than  groups  which  have  a oyramidal 
system,  but  will  not  necessarily  be  more  hierarchical  than  the 
group  which  has  random  distribution  of  choices.  It  may  be  important 
to  study  the  conditions  under  which  hierarchy  develops,  and  the 
differences  in  performance  of  groups  of  different  hierarchy. 

(2)  h is  a measure  of  the  extent  to  -which  persons  constitute 
a scale  in  the  quality  being  measured  (in  the  Guttr.an  sense).  Just 
as  Guttman  can  examine  whether  items  can  be  arrayed  in  a continuum 
which  is  oerceived  similarly  by  all  persons,  so  Hohn's  index 
examines  whether  persons  form  such  a unidimens ional  continuum. 

It  is  of  interest  to  note  tiu.t  some  hierarchies  which  satisfy 
Guttman's  requirements  for  a scale  are  not  extreme  hierarchies  in 
Hohn's  definition. 

(3)  h is  a measure  of  reliability.  Fcr  the  rectangular 
matrix  where  raters  need  not  be  ratees,  Horst  has  shown  t-hut  OC 
provides  a measure  of  reliability  or  agreement  of  raters  (5). 
d.  estimates  the  correlation  expected  between  sets  of  scores 
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obtained  from  two  samples  of  raters • h performs  the  same  function 
for  the  square  matrix  of  sociometric  ranks.  This  appeal's  to  meet 
Pepins&y's  demand  for  a measure  cf  reliability  of  sociometrics  , 
in  the  sense  of  consistency  over  judges  (7).  No  splitting  of  the 
group  into  chance  halves  is  required. 

(It)  h is  a measure  of  communality  of  thinking  among  judges. 

If  a group  has  a large  h,  the  raters  agree  on  their  criteria  and 

' T.y 

frames  of  reference.  i\  low  value  of  h indicates  diversity  in 
members'  perceptions.  Thus  Gage  and  Exline  (using  the  split-naif 
technique)  find  greater  agreement  on  ratings  cf  others'  productivity 
than  on  rating  of  the  same  persons'  leisure  time  attractiveness  (k) • 
If  a group  has  low  internal  consistency  in  ratings  of  “degree  to 
which  each  person  contributes  to  the  aims  cf  the  group" , this  would 
suggest  the  oresence  of  conflicting  frames  of  reference  and  we 
might  predict  that  such  a group  would  be  inefficient.  Roby  has 
done  nreliminary  research  ox  this  kind,  studies  of  change  in  h 
over  time  might  reflect  development  of  a common  reference  frame, 
es  jecially  if  h ..ere  determined  separately  for  such  dimensions 
as  liking  and  contribution  to  the  tasn. 

,.e  should  note  that,  like  , h depends  on  the  size  of  the 

group.  It  will  tend  to  be  larger  in  a larger  group,  other  things 
being  equal.  Therefore  h must  be  interpreted  with  the  size  of  the 
group  in  mind,  or  some  transformation  will  be  required  such  as  the 
"phi  bar"  index  (2)  derived  from  ^ . 

A variant  of  conventional  internal  consistency  analysis  also 
may  be  profitably  applied  to  suciometric  data.  The  common  item- 
test  correlation  has  its  analog  in  the  correlation  of  any  row  with 
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the marginal  row,  i.e.,  with  the  score  structure.  This  is  a measure 
of  the  extent  to  which  the  individual  shares  a frame  of  reference 
with  other  raters,  one  can  similarly  correlate  the  row  with  a 
row  of  criterion  scores.  Ancierhaiter,  wilkir.s,  ar.d  higby  (1) 
ap^ly  some  of  these  approached  to  Marine  Officer  Candidates,  and 
show  some  evidence  that  the  person  who  agrees  with  the  marginal 

■ i 

rating  or  with  officers'  ratings  of  the  candidates,  himself  tends, 
to  receive  a high  rating.  Homogeneity  of  these  groups;as  .judged 
by  the  mean  row-marginal  correlation  increased  with  ti:e.\ 

Hohn's  Q is  not  novel  mathematically,  being  a direct  application 
of  product-moment  technique.  It  does,  hc..ever,  draw  attention  to 
a possibly  fruitful  type  of  analysis  which  seems  not  to  have  been 
made  except  in  studies  of  stability  of  sociometric  scores  over  time. 
Consider  its  possible  application  to  a bomber  crew,  where  each  man 
has  a designated  station.  Then  a crew  where  the  navigator  is  rated 
high,  and  the  flight  engineer  average,  is. in  some  respects  different 
from  a crew  where  the  reverse  is  true.  Applying  © to  the  score 
structures  of  many  crews  would  yield  a correlation  matrix  which 
could  perhaps  be  separated  into  several  types  of  structure.  It  is 
reasonable  to  suppose  that  these  structures  might  be  significant 
either  as  reflections  of  values  within  the  crew,  or  as  communication 
networks  which  influence  effectiveness. 
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